Matching Similarity for Keyword-Based Clustering

نویسندگان

Mohammad Rezaei

Pasi Fränti

چکیده

Semantic clustering of objects such as documents, web sites and movies based on their keywords is a challenging problem. This requires a similarity measure between two sets of keywords. We present a new measure based on matching the words of two groups assuming that a similarity measure between two individual words is available. The proposed matching similarity measure avoids the problems of traditional measures including minimum, maximum and average similarities. We demonstrate that it provides better clustering than other measures in location-based service application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Search: Document Ranking and Clustering Using Computer Science Ontology and N-Grams

Semantic similarity has become an important tool and widely been used to solve traditional Information Retrieval problems. This study adopts ontology of computer science and proposes an ontology indexing weight based on Wu and Palmer’s edge counting measure and uses the N-grams method for computing a family of word similarity. The study also compares the subsumption weight between Hliaoutakis a...

متن کامل

Measuring Similarity in Description Logics using Refinement Operators

Similarity assessment is a key operation in many artificial intelligence fields, such as case-based reasoning, instance-based learning, ontology matching, clustering, etc. This paper presents a novel measure for assessing similarity between individuals represented using Description Logic (DL). We will show how the ideas of {\em refinement operators} and {\em refinement graph}, originally introd...

متن کامل

Measuring Similarity in Description Logics using Refinement Operators

متن کامل

Utilizing phrase-similarity measures for detecting and clustering informative RSS news articles

As the number of RSS news feeds continue to increase over the Internet, it becomes necessary to minimize the workload of the user who is otherwise required to scan through huge number of news articles to find related articles of interest, which is a tedious and often an impossible task. In order to solve this problem, we present a novel approach, called InFRSS, which consists of a correlation-b...

متن کامل

Evaluation of Similarity Measures for Template Matching

Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Matching Similarity for Keyword-Based Clustering

نویسندگان

چکیده

منابع مشابه

Semantic Search: Document Ranking and Clustering Using Computer Science Ontology and N-Grams

Measuring Similarity in Description Logics using Refinement Operators

Measuring Similarity in Description Logics using Refinement Operators

Utilizing phrase-similarity measures for detecting and clustering informative RSS news articles

Evaluation of Similarity Measures for Template Matching

عنوان ژورنال:

اشتراک گذاری